Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 109127 |
| Missing cells | 195239 |
| Missing cells (%) | 8.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 25.1 MiB |
| Average record size in memory | 241.0 B |
Variable types
| Text | 1 |
|---|---|
| Categorical | 8 |
| Numeric | 12 |
high_conf_clean has constant value "1.0" | Constant |
feature_006 is highly overall correlated with feature_014 | High correlation |
feature_008 is highly overall correlated with feature_009 and 1 other fields | High correlation |
feature_009 is highly overall correlated with feature_008 and 1 other fields | High correlation |
feature_010 is highly overall correlated with feature_008 and 1 other fields | High correlation |
feature_014 is highly overall correlated with feature_006 | High correlation |
feature_007 is highly imbalanced (61.7%) | Imbalance |
feature_011 is highly imbalanced (91.7%) | Imbalance |
feature_012 is highly imbalanced (50.8%) | Imbalance |
feature_001 has 12817 (11.7%) missing values | Missing |
feature_002 has 12638 (11.6%) missing values | Missing |
feature_003 has 12638 (11.6%) missing values | Missing |
feature_005 has 2311 (2.1%) missing values | Missing |
feature_007 has 2343 (2.1%) missing values | Missing |
feature_008 has 2452 (2.2%) missing values | Missing |
feature_009 has 2452 (2.2%) missing values | Missing |
feature_010 has 2452 (2.2%) missing values | Missing |
feature_011 has 2336 (2.1%) missing values | Missing |
feature_012 has 4516 (4.1%) missing values | Missing |
feature_013 has 2336 (2.1%) missing values | Missing |
feature_014 has 2355 (2.2%) missing values | Missing |
feature_017 has 11378 (10.4%) missing values | Missing |
feature_018 has 11181 (10.2%) missing values | Missing |
high_conf_clean has 45186 (41.4%) missing values | Missing |
is_cheating has 63941 (58.6%) missing values | Missing |
feature_010 is highly skewed (γ1 = 182.1561448) | Skewed |
user_hash has unique values | Unique |
feature_006 has 18351 (16.8%) zeros | Zeros |
feature_008 has 31159 (28.6%) zeros | Zeros |
feature_009 has 71139 (65.2%) zeros | Zeros |
feature_010 has 51479 (47.2%) zeros | Zeros |
feature_016 has 11378 (10.4%) zeros | Zeros |
feature_018 has 2012 (1.8%) zeros | Zeros |
Reproduction
| Analysis started | 2026-01-17 18:17:45.091510 |
|---|---|
| Analysis finished | 2026-01-17 18:18:27.882821 |
| Duration | 42.79 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
user_hash
Text
Unique
| Distinct | 109127 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.4 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Unique
| Unique | 109127 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | d558f000c89bed2a |
|---|---|
| 2nd row | 1e9668e66b1d6253 |
| 3rd row | c0e9e1e943261b5d |
| 4th row | df3d93556a7b915d |
| 5th row | 302265f9f7d11aa1 |
| Value | Count | Frequency (%) |
| 2ee0108fc5b82c63 | 1 | < 0.1% |
| f0028eb1ed639989 | 1 | < 0.1% |
| d558f000c89bed2a | 1 | < 0.1% |
| 1e9668e66b1d6253 | 1 | < 0.1% |
| c0e9e1e943261b5d | 1 | < 0.1% |
| df3d93556a7b915d | 1 | < 0.1% |
| 302265f9f7d11aa1 | 1 | < 0.1% |
| 41e3aad8953a1b39 | 1 | < 0.1% |
| f643939c4656c10f | 1 | < 0.1% |
| 789d73d2194a4e66 | 1 | < 0.1% |
| Other values (109117) | 109117 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 109826 | 6.3% |
| 7 | 109793 | 6.3% |
| 1 | 109735 | 6.3% |
| a | 109361 | 6.3% |
| 2 | 109284 | 6.3% |
| d | 109097 | 6.2% |
| c | 109095 | 6.2% |
| f | 109028 | 6.2% |
| 0 | 109015 | 6.2% |
| 6 | 108995 | 6.2% |
| Other values (6) | 652803 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1746032 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 109826 | 6.3% |
| 7 | 109793 | 6.3% |
| 1 | 109735 | 6.3% |
| a | 109361 | 6.3% |
| 2 | 109284 | 6.3% |
| d | 109097 | 6.2% |
| c | 109095 | 6.2% |
| f | 109028 | 6.2% |
| 0 | 109015 | 6.2% |
| 6 | 108995 | 6.2% |
| Other values (6) | 652803 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1746032 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 109826 | 6.3% |
| 7 | 109793 | 6.3% |
| 1 | 109735 | 6.3% |
| a | 109361 | 6.3% |
| 2 | 109284 | 6.3% |
| d | 109097 | 6.2% |
| c | 109095 | 6.2% |
| f | 109028 | 6.2% |
| 0 | 109015 | 6.2% |
| 6 | 108995 | 6.2% |
| Other values (6) | 652803 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1746032 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 109826 | 6.3% |
| 7 | 109793 | 6.3% |
| 1 | 109735 | 6.3% |
| a | 109361 | 6.3% |
| 2 | 109284 | 6.3% |
| d | 109097 | 6.2% |
| c | 109095 | 6.2% |
| f | 109028 | 6.2% |
| 0 | 109015 | 6.2% |
| 6 | 108995 | 6.2% |
| Other values (6) | 652803 |
feature_001
Categorical
Missing
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12817 |
| Missing (%) | 11.7% |
| Memory size | 7.0 MiB |
| 1.0 | |
|---|---|
| 4.0 | |
| 5.0 | |
| 2.0 | |
| 3.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 3.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 33921 | |
| 4.0 | 29544 | |
| 5.0 | 21575 | |
| 2.0 | 6547 | 6.0% |
| 3.0 | 4723 | 4.3% |
| (Missing) | 12817 | 11.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 33921 | |
| 4.0 | 29544 | |
| 5.0 | 21575 | |
| 2.0 | 6547 | 6.8% |
| 3.0 | 4723 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 96310 | |
| 0 | 96310 | |
| 1 | 33921 | 11.7% |
| 4 | 29544 | 10.2% |
| 5 | 21575 | 7.5% |
| 2 | 6547 | 2.3% |
| 3 | 4723 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 288930 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 96310 | |
| 0 | 96310 | |
| 1 | 33921 | 11.7% |
| 4 | 29544 | 10.2% |
| 5 | 21575 | 7.5% |
| 2 | 6547 | 2.3% |
| 3 | 4723 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 288930 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 96310 | |
| 0 | 96310 | |
| 1 | 33921 | 11.7% |
| 4 | 29544 | 10.2% |
| 5 | 21575 | 7.5% |
| 2 | 6547 | 2.3% |
| 3 | 4723 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 288930 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 96310 | |
| 0 | 96310 | |
| 1 | 33921 | 11.7% |
| 4 | 29544 | 10.2% |
| 5 | 21575 | 7.5% |
| 2 | 6547 | 2.3% |
| 3 | 4723 | 1.6% |
feature_002
Real number (ℝ)
Missing
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12638 |
| Missing (%) | 11.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.2640923 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.6792322 |
|---|---|
| Coefficient of variation (CV) | 0.62832417 |
| Kurtosis | -1.3163001 |
| Mean | 4.2640923 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.55755389 |
| Sum | 411438 |
| Variance | 7.1782854 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 41032 | |
| 8 | 11707 | 10.7% |
| 7 | 10233 | 9.4% |
| 3 | 9642 | 8.8% |
| 9 | 7052 | 6.5% |
| 5 | 4735 | 4.3% |
| 6 | 4453 | 4.1% |
| 1 | 3806 | 3.5% |
| 4 | 3466 | 3.2% |
| 10 | 363 | 0.3% |
| (Missing) | 12638 | 11.6% |
| Value | Count | Frequency (%) |
| 1 | 3806 | 3.5% |
| 2 | 41032 | |
| 3 | 9642 | 8.8% |
| 4 | 3466 | 3.2% |
| 5 | 4735 | 4.3% |
| 6 | 4453 | 4.1% |
| 7 | 10233 | 9.4% |
| 8 | 11707 | 10.7% |
| 9 | 7052 | 6.5% |
| 10 | 363 | 0.3% |
| Value | Count | Frequency (%) |
| 10 | 363 | 0.3% |
| 9 | 7052 | 6.5% |
| 8 | 11707 | 10.7% |
| 7 | 10233 | 9.4% |
| 6 | 4453 | 4.1% |
| 5 | 4735 | 4.3% |
| 4 | 3466 | 3.2% |
| 3 | 9642 | 8.8% |
| 2 | 41032 | |
| 1 | 3806 | 3.5% |
feature_003
Real number (ℝ)
Missing
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12638 |
| Missing (%) | 11.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2237561 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.9666429 |
|---|---|
| Coefficient of variation (CV) | 0.31598971 |
| Kurtosis | -0.39345729 |
| Mean | 6.2237561 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.57513346 |
| Sum | 600524 |
| Variance | 3.8676841 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 21541 | |
| 6 | 17060 | |
| 7 | 17012 | |
| 5 | 10653 | |
| 4 | 10423 | |
| 9 | 9663 | |
| 3 | 6009 | 5.5% |
| 2 | 2385 | 2.2% |
| 1 | 1711 | 1.6% |
| 10 | 32 | < 0.1% |
| (Missing) | 12638 |
| Value | Count | Frequency (%) |
| 1 | 1711 | 1.6% |
| 2 | 2385 | 2.2% |
| 3 | 6009 | 5.5% |
| 4 | 10423 | |
| 5 | 10653 | |
| 6 | 17060 | |
| 7 | 17012 | |
| 8 | 21541 | |
| 9 | 9663 | |
| 10 | 32 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 32 | < 0.1% |
| 9 | 9663 | |
| 8 | 21541 | |
| 7 | 17012 | |
| 6 | 17060 | |
| 5 | 10653 | |
| 4 | 10423 | |
| 3 | 6009 | 5.5% |
| 2 | 2385 | 2.2% |
| 1 | 1711 | 1.6% |
feature_004
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1011 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.1103167 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 3.6891183 |
|---|---|
| Coefficient of variation (CV) | 0.72189622 |
| Kurtosis | -1.8044433 |
| Mean | 5.1103167 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.038256031 |
| Sum | 552507 |
| Variance | 13.609594 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 40888 | |
| 9 | 22816 | |
| 8 | 12906 | 11.8% |
| 10 | 9530 | 8.7% |
| 7 | 8551 | 7.8% |
| 2 | 5844 | 5.4% |
| 6 | 3171 | 2.9% |
| 3 | 2092 | 1.9% |
| 5 | 1608 | 1.5% |
| 4 | 710 | 0.7% |
| (Missing) | 1011 | 0.9% |
| Value | Count | Frequency (%) |
| 1 | 40888 | |
| 2 | 5844 | 5.4% |
| 3 | 2092 | 1.9% |
| 4 | 710 | 0.7% |
| 5 | 1608 | 1.5% |
| 6 | 3171 | 2.9% |
| 7 | 8551 | 7.8% |
| 8 | 12906 | 11.8% |
| 9 | 22816 | |
| 10 | 9530 | 8.7% |
| Value | Count | Frequency (%) |
| 10 | 9530 | 8.7% |
| 9 | 22816 | |
| 8 | 12906 | 11.8% |
| 7 | 8551 | 7.8% |
| 6 | 3171 | 2.9% |
| 5 | 1608 | 1.5% |
| 4 | 710 | 0.7% |
| 3 | 2092 | 1.9% |
| 2 | 5844 | 5.4% |
| 1 | 40888 |
feature_005
Real number (ℝ)
Missing
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2311 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.3750562 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.7139017 |
|---|---|
| Coefficient of variation (CV) | 0.62031242 |
| Kurtosis | -1.4140472 |
| Mean | 4.3750562 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.2990346 |
| Sum | 467326 |
| Variance | 7.3652624 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 27540 | |
| 7 | 15827 | |
| 1 | 14963 | |
| 3 | 10926 | 10.0% |
| 8 | 10295 | 9.4% |
| 6 | 8992 | 8.2% |
| 9 | 7389 | 6.8% |
| 4 | 5312 | 4.9% |
| 5 | 5213 | 4.8% |
| 10 | 359 | 0.3% |
| (Missing) | 2311 | 2.1% |
| Value | Count | Frequency (%) |
| 1 | 14963 | |
| 2 | 27540 | |
| 3 | 10926 | 10.0% |
| 4 | 5312 | 4.9% |
| 5 | 5213 | 4.8% |
| 6 | 8992 | 8.2% |
| 7 | 15827 | |
| 8 | 10295 | 9.4% |
| 9 | 7389 | 6.8% |
| 10 | 359 | 0.3% |
| Value | Count | Frequency (%) |
| 10 | 359 | 0.3% |
| 9 | 7389 | 6.8% |
| 8 | 10295 | 9.4% |
| 7 | 15827 | |
| 6 | 8992 | 8.2% |
| 5 | 5213 | 4.8% |
| 4 | 5312 | 4.9% |
| 3 | 10926 | 10.0% |
| 2 | 27540 | |
| 1 | 14963 |
feature_006
Real number (ℝ)
High correlation Zeros
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 896 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4397723 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 18351 |
| Zeros (%) | 16.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.0179774 |
|---|---|
| Coefficient of variation (CV) | 0.6797595 |
| Kurtosis | -1.4804395 |
| Mean | 4.4397723 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.32735666 |
| Sum | 480521 |
| Variance | 9.1081877 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 22151 | |
| 6 | 19333 | |
| 0 | 18351 | |
| 1 | 13577 | |
| 8 | 13340 | |
| 5 | 7373 | 6.8% |
| 2 | 6923 | 6.3% |
| 3 | 2680 | 2.5% |
| 9 | 2282 | 2.1% |
| 4 | 2055 | 1.9% |
| (Missing) | 896 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 18351 | |
| 1 | 13577 | |
| 2 | 6923 | 6.3% |
| 3 | 2680 | 2.5% |
| 4 | 2055 | 1.9% |
| 5 | 7373 | 6.8% |
| 6 | 19333 | |
| 7 | 22151 | |
| 8 | 13340 | |
| 9 | 2282 | 2.1% |
| Value | Count | Frequency (%) |
| 10 | 166 | 0.2% |
| 9 | 2282 | 2.1% |
| 8 | 13340 | |
| 7 | 22151 | |
| 6 | 19333 | |
| 5 | 7373 | 6.8% |
| 4 | 2055 | 1.9% |
| 3 | 2680 | 2.5% |
| 2 | 6923 | 6.3% |
| 1 | 13577 |
feature_007
Categorical
Imbalance Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2343 |
| Missing (%) | 2.1% |
| Memory size | 7.1 MiB |
| 1.0 | |
|---|---|
| 0.0 | 7962 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 98822 | |
| 0.0 | 7962 | 7.3% |
| (Missing) | 2343 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 98822 | |
| 0.0 | 7962 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 114746 | |
| . | 106784 | |
| 1 | 98822 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 320352 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 114746 | |
| . | 106784 | |
| 1 | 98822 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 320352 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 114746 | |
| . | 106784 | |
| 1 | 98822 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 320352 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 114746 | |
| . | 106784 | |
| 1 | 98822 |
feature_008
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2452 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6247012 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 31159 |
| Zeros (%) | 28.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 9 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 4.0071017 |
|---|---|
| Coefficient of variation (CV) | 1.1054985 |
| Kurtosis | -1.5133556 |
| Mean | 3.6247012 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.57578216 |
| Sum | 386665 |
| Variance | 16.056864 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 31159 | |
| 1 | 30688 | |
| 9 | 18166 | |
| 10 | 9733 | 8.9% |
| 8 | 6738 | 6.2% |
| 2 | 3667 | 3.4% |
| 7 | 2047 | 1.9% |
| 3 | 1588 | 1.5% |
| 6 | 1424 | 1.3% |
| 4 | 1047 | 1.0% |
| (Missing) | 2452 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 31159 | |
| 1 | 30688 | |
| 2 | 3667 | 3.4% |
| 3 | 1588 | 1.5% |
| 4 | 1047 | 1.0% |
| 5 | 418 | 0.4% |
| 6 | 1424 | 1.3% |
| 7 | 2047 | 1.9% |
| 8 | 6738 | 6.2% |
| 9 | 18166 |
| Value | Count | Frequency (%) |
| 10 | 9733 | 8.9% |
| 9 | 18166 | |
| 8 | 6738 | 6.2% |
| 7 | 2047 | 1.9% |
| 6 | 1424 | 1.3% |
| 5 | 418 | 0.4% |
| 4 | 1047 | 1.0% |
| 3 | 1588 | 1.5% |
| 2 | 3667 | 3.4% |
| 1 | 30688 |
feature_009
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2452 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4374315 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 71139 |
| Zeros (%) | 65.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 3.9597776 |
|---|---|
| Coefficient of variation (CV) | 1.6245698 |
| Kurtosis | -0.72600408 |
| Mean | 2.4374315 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.106461 |
| Sum | 260013 |
| Variance | 15.679839 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 71139 | |
| 9 | 20649 | 18.9% |
| 1 | 6591 | 6.0% |
| 10 | 5414 | 5.0% |
| 2 | 1434 | 1.3% |
| 8 | 1049 | 1.0% |
| 7 | 171 | 0.2% |
| 3 | 79 | 0.1% |
| 6 | 65 | 0.1% |
| 4 | 63 | 0.1% |
| (Missing) | 2452 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 71139 | |
| 1 | 6591 | 6.0% |
| 2 | 1434 | 1.3% |
| 3 | 79 | 0.1% |
| 4 | 63 | 0.1% |
| 5 | 21 | < 0.1% |
| 6 | 65 | 0.1% |
| 7 | 171 | 0.2% |
| 8 | 1049 | 1.0% |
| 9 | 20649 | 18.9% |
| Value | Count | Frequency (%) |
| 10 | 5414 | 5.0% |
| 9 | 20649 | |
| 8 | 1049 | 1.0% |
| 7 | 171 | 0.2% |
| 6 | 65 | 0.1% |
| 5 | 21 | < 0.1% |
| 4 | 63 | 0.1% |
| 3 | 79 | 0.1% |
| 2 | 1434 | 1.3% |
| 1 | 6591 | 6.0% |
feature_010
Real number (ℝ)
High correlation Missing Skewed Zeros
| Distinct | 5667 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 2452 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 662.79363 |
| Minimum | 0 |
|---|---|
| Maximum | 2899149 |
| Zeros | 51479 |
| Zeros (%) | 47.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 512 |
| 95-th percentile | 2254 |
| Maximum | 2899149 |
| Range | 2899149 |
| Interquartile range (IQR) | 512 |
Descriptive statistics
| Standard deviation | 13114.468 |
|---|---|
| Coefficient of variation (CV) | 19.786654 |
| Kurtosis | 37487.184 |
| Mean | 662.79363 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 182.15614 |
| Sum | 70703510 |
| Variance | 1.7198927 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 51479 | |
| 1 | 1001 | 0.9% |
| 2 | 604 | 0.6% |
| 3 | 449 | 0.4% |
| 4 | 320 | 0.3% |
| 5 | 295 | 0.3% |
| 7 | 244 | 0.2% |
| 6 | 237 | 0.2% |
| 8 | 211 | 0.2% |
| 9 | 196 | 0.2% |
| Other values (5657) | 51639 | |
| (Missing) | 2452 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 51479 | |
| 1 | 1001 | 0.9% |
| 2 | 604 | 0.6% |
| 3 | 449 | 0.4% |
| 4 | 320 | 0.3% |
| 5 | 295 | 0.3% |
| 6 | 237 | 0.2% |
| 7 | 244 | 0.2% |
| 8 | 211 | 0.2% |
| 9 | 196 | 0.2% |
| Value | Count | Frequency (%) |
| 2899149 | 1 | |
| 2611918 | 1 | |
| 894470 | 1 | |
| 782980 | 1 | |
| 591000 | 1 | |
| 402933 | 1 | |
| 396166 | 1 | |
| 323580 | 1 | |
| 286122 | 1 | |
| 242488 | 1 |
feature_011
Categorical
Imbalance Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2336 |
| Missing (%) | 2.1% |
| Memory size | 7.1 MiB |
| 0.0 | |
|---|---|
| 1.0 | 1102 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 105689 | |
| 1.0 | 1102 | 1.0% |
| (Missing) | 2336 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 105689 | |
| 1.0 | 1102 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 212480 | |
| . | 106791 | |
| 1 | 1102 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 320373 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 212480 | |
| . | 106791 | |
| 1 | 1102 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 320373 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 212480 | |
| . | 106791 | |
| 1 | 1102 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 320373 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 212480 | |
| . | 106791 | |
| 1 | 1102 | 0.3% |
feature_012
Categorical
Imbalance Missing
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4516 |
| Missing (%) | 4.1% |
| Memory size | 7.1 MiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | 5494 |
| 3.0 | 3873 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 84516 | |
| 1.0 | 10728 | 9.8% |
| 2.0 | 5494 | 5.0% |
| 3.0 | 3873 | 3.5% |
| (Missing) | 4516 | 4.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 84516 | |
| 1.0 | 10728 | 10.3% |
| 2.0 | 5494 | 5.3% |
| 3.0 | 3873 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 189127 | |
| . | 104611 | |
| 1 | 10728 | 3.4% |
| 2 | 5494 | 1.8% |
| 3 | 3873 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 313833 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 189127 | |
| . | 104611 | |
| 1 | 10728 | 3.4% |
| 2 | 5494 | 1.8% |
| 3 | 3873 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 313833 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 189127 | |
| . | 104611 | |
| 1 | 10728 | 3.4% |
| 2 | 5494 | 1.8% |
| 3 | 3873 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 313833 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 189127 | |
| . | 104611 | |
| 1 | 10728 | 3.4% |
| 2 | 5494 | 1.8% |
| 3 | 3873 | 1.2% |
feature_013
Categorical
Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2336 |
| Missing (%) | 2.1% |
| Memory size | 7.1 MiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 57892 | |
| 0.0 | 48899 | |
| (Missing) | 2336 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 57892 | |
| 0.0 | 48899 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 155690 | |
| . | 106791 | |
| 1 | 57892 | 18.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 320373 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 155690 | |
| . | 106791 | |
| 1 | 57892 | 18.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 320373 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 155690 | |
| . | 106791 | |
| 1 | 57892 | 18.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 320373 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 155690 | |
| . | 106791 | |
| 1 | 57892 | 18.1% |
feature_014
Categorical
High correlation Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2355 |
| Missing (%) | 2.2% |
| Memory size | 7.1 MiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 89071 | |
| 0.0 | 17701 | 16.2% |
| (Missing) | 2355 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 89071 | |
| 0.0 | 17701 | 16.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 124473 | |
| . | 106772 | |
| 1 | 89071 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 320316 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 124473 | |
| . | 106772 | |
| 1 | 89071 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 320316 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 124473 | |
| . | 106772 | |
| 1 | 89071 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 320316 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 124473 | |
| . | 106772 | |
| 1 | 89071 |
feature_015
Real number (ℝ)
| Distinct | 68340 |
|---|---|
| Distinct (%) | 62.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.121775 |
| Minimum | -0.87395833 |
|---|---|
| Maximum | 894.20021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | -0.87395833 |
|---|---|
| 5-th percentile | 0.025555556 |
| Q1 | 0.039699074 |
| median | 0.71013889 |
| Q3 | 13.882876 |
| 95-th percentile | 133.4555 |
| Maximum | 894.20021 |
| Range | 895.07417 |
| Interquartile range (IQR) | 13.843177 |
Descriptive statistics
| Standard deviation | 72.961014 |
|---|---|
| Coefficient of variation (CV) | 2.9042938 |
| Kurtosis | 41.768502 |
| Mean | 25.121775 |
| Median Absolute Deviation (MAD) | 0.68385417 |
| Skewness | 5.7337383 |
| Sum | 2741463.9 |
| Variance | 5323.3096 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.03252314815 | 38 | < 0.1% |
| 0.03268518519 | 37 | < 0.1% |
| 0.03233796296 | 34 | < 0.1% |
| 0.03256944444 | 34 | < 0.1% |
| 0.03217592593 | 34 | < 0.1% |
| 0.03181712963 | 33 | < 0.1% |
| 0.03047453704 | 33 | < 0.1% |
| 0.03359953704 | 33 | < 0.1% |
| 0.03231481481 | 33 | < 0.1% |
| 0.03513888889 | 32 | < 0.1% |
| Other values (68330) | 108786 |
| Value | Count | Frequency (%) |
| -0.8739583333 | 1 | |
| 0.01414351852 | 1 | |
| 0.01456018519 | 1 | |
| 0.0146412037 | 1 | |
| 0.0147337963 | 1 | |
| 0.01482638889 | 1 | |
| 0.01493055556 | 1 | |
| 0.01503472222 | 1 | |
| 0.01513888889 | 1 | |
| 0.01517361111 | 1 |
| Value | Count | Frequency (%) |
| 894.2002083 | 1 | |
| 893.9557407 | 1 | |
| 888.6703125 | 1 | |
| 882.6855093 | 1 | |
| 867.405 | 1 | |
| 866.7427894 | 1 | |
| 866.2371528 | 1 | |
| 865.8012847 | 1 | |
| 865.5734491 | 1 | |
| 865.2211343 | 1 |
feature_016
Real number (ℝ)
Zeros
| Distinct | 139 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7829868 |
| Minimum | 0 |
|---|---|
| Maximum | 326 |
| Zeros | 11378 |
| Zeros (%) | 10.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 10 |
| Maximum | 326 |
| Range | 326 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 6.2950305 |
|---|---|
| Coefficient of variation (CV) | 2.2619692 |
| Kurtosis | 279.70011 |
| Mean | 2.7829868 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.932537 |
| Sum | 303699 |
| Variance | 39.62741 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 55792 | |
| 2 | 16023 | 14.7% |
| 0 | 11378 | 10.4% |
| 3 | 7516 | 6.9% |
| 4 | 4276 | 3.9% |
| 5 | 2908 | 2.7% |
| 6 | 2052 | 1.9% |
| 7 | 1542 | 1.4% |
| 8 | 1074 | 1.0% |
| 9 | 879 | 0.8% |
| Other values (129) | 5687 | 5.2% |
| Value | Count | Frequency (%) |
| 0 | 11378 | 10.4% |
| 1 | 55792 | |
| 2 | 16023 | 14.7% |
| 3 | 7516 | 6.9% |
| 4 | 4276 | 3.9% |
| 5 | 2908 | 2.7% |
| 6 | 2052 | 1.9% |
| 7 | 1542 | 1.4% |
| 8 | 1074 | 1.0% |
| 9 | 879 | 0.8% |
| Value | Count | Frequency (%) |
| 326 | 1 | |
| 268 | 1 | |
| 256 | 1 | |
| 231 | 1 | |
| 203 | 1 | |
| 199 | 1 | |
| 195 | 1 | |
| 185 | 1 | |
| 181 | 1 | |
| 176 | 1 |
feature_017
Real number (ℝ)
Missing
| Distinct | 30759 |
|---|---|
| Distinct (%) | 31.5% |
| Missing | 11378 |
| Missing (%) | 10.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.80772 |
| Minimum | 0 |
|---|---|
| Maximum | 24 |
| Zeros | 18 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.9333333 |
| Q1 | 9.6166667 |
| median | 14.666667 |
| Q3 | 18.683439 |
| 95-th percentile | 22.546094 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 9.0667722 |
Descriptive statistics
| Standard deviation | 6.1495907 |
|---|---|
| Coefficient of variation (CV) | 0.44537335 |
| Kurtosis | -0.67589661 |
| Mean | 13.80772 |
| Median Absolute Deviation (MAD) | 4.45 |
| Skewness | -0.45046201 |
| Sum | 1349690.9 |
| Variance | 37.817466 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.31666667 | 87 | 0.1% |
| 16.63333333 | 84 | 0.1% |
| 18.48333333 | 82 | 0.1% |
| 14.63333333 | 81 | 0.1% |
| 17.15 | 81 | 0.1% |
| 16.26666667 | 81 | 0.1% |
| 18.05 | 80 | 0.1% |
| 16.71666667 | 80 | 0.1% |
| 17.68333333 | 79 | 0.1% |
| 14.66666667 | 79 | 0.1% |
| Other values (30749) | 96935 | |
| (Missing) | 11378 | 10.4% |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 0.0001312630792 | 1 | < 0.1% |
| 0.001016023877 | 1 | < 0.1% |
| 0.001892308703 | 1 | < 0.1% |
| 0.004145499675 | 1 | < 0.1% |
| 0.004354619702 | 1 | < 0.1% |
| 0.005328150353 | 1 | < 0.1% |
| 0.005586209262 | 1 | < 0.1% |
| 0.005771869028 | 1 | < 0.1% |
| 0.008333333333 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 24 | 6 | |
| 23.99800237 | 1 | < 0.1% |
| 23.99495856 | 1 | < 0.1% |
| 23.99444413 | 1 | < 0.1% |
| 23.99301571 | 1 | < 0.1% |
| 23.99166667 | 3 | |
| 23.99166667 | 2 | < 0.1% |
| 23.99131866 | 1 | < 0.1% |
| 23.99043087 | 1 | < 0.1% |
| 23.98674592 | 1 | < 0.1% |
feature_018
Real number (ℝ)
Missing Zeros
| Distinct | 11669 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 11181 |
| Missing (%) | 10.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.57458319 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 2012 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.15000001 |
| Q1 | 0.42600001 |
| median | 0.60000002 |
| Q3 | 0.75 |
| 95-th percentile | 0.89999998 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.32399999 |
Descriptive statistics
| Standard deviation | 0.22776338 |
|---|---|
| Coefficient of variation (CV) | 0.39639757 |
| Kurtosis | -0.29917929 |
| Mean | 0.57458319 |
| Median Absolute Deviation (MAD) | 0.15000004 |
| Skewness | -0.49464808 |
| Sum | 56278.125 |
| Variance | 0.051876156 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 4234 | 3.9% |
| 0.75 | 3878 | 3.6% |
| 0.8000000119 | 3062 | 2.8% |
| 0.6000000238 | 2666 | 2.4% |
| 0 | 2012 | 1.8% |
| 0.5500000119 | 1907 | 1.7% |
| 0.6999999881 | 1823 | 1.7% |
| 0.8500000238 | 1721 | 1.6% |
| 0.3000000119 | 1708 | 1.6% |
| 0.400000006 | 1659 | 1.5% |
| Other values (11659) | 73276 | |
| (Missing) | 11181 | 10.2% |
| Value | Count | Frequency (%) |
| 0 | 2012 | |
| 0.004545450211 | 1 | < 0.1% |
| 0.004999999888 | 5 | < 0.1% |
| 0.006024099886 | 3 | < 0.1% |
| 0.009090909734 | 1 | < 0.1% |
| 0.009100000374 | 1 | < 0.1% |
| 0.009999999776 | 22 | < 0.1% |
| 0.01109999977 | 25 | < 0.1% |
| 0.0111111002 | 8 | < 0.1% |
| 0.01136364974 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1478 | |
| 0.9962963263 | 1 | < 0.1% |
| 0.9950000048 | 1 | < 0.1% |
| 0.9944440126 | 6 | < 0.1% |
| 0.9900000095 | 22 | < 0.1% |
| 0.9889000058 | 1 | < 0.1% |
| 0.9888890088 | 1 | < 0.1% |
| 0.988888979 | 24 | < 0.1% |
| 0.9850000143 | 10 | < 0.1% |
| 0.9833334982 | 2 | < 0.1% |
high_conf_clean
Categorical
Constant Missing
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 45186 |
| Missing (%) | 41.4% |
| Memory size | 6.9 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 63941 | |
| (Missing) | 45186 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 63941 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 63941 | |
| . | 63941 | |
| 0 | 63941 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 191823 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 63941 | |
| . | 63941 | |
| 0 | 63941 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 191823 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 63941 | |
| . | 63941 | |
| 0 | 63941 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 191823 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 63941 | |
| . | 63941 | |
| 0 | 63941 |
is_cheating
Categorical
Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 63941 |
| Missing (%) | 58.6% |
| Memory size | 6.8 MiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 31414 | |
| 1.0 | 13772 | 12.6% |
| (Missing) | 63941 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 31414 | |
| 1.0 | 13772 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 76600 | |
| . | 45186 | |
| 1 | 13772 | 10.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 135558 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 76600 | |
| . | 45186 | |
| 1 | 13772 | 10.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 135558 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 76600 | |
| . | 45186 | |
| 1 | 13772 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 135558 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 76600 | |
| . | 45186 | |
| 1 | 13772 | 10.2% |
Interactions
Correlations
| feature_001 | feature_002 | feature_003 | feature_004 | feature_005 | feature_006 | feature_007 | feature_008 | feature_009 | feature_010 | feature_011 | feature_012 | feature_013 | feature_014 | feature_015 | feature_016 | feature_017 | feature_018 | is_cheating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| feature_001 | 1.000 | 0.252 | 0.251 | 0.185 | 0.085 | 0.130 | 0.019 | 0.037 | 0.058 | 0.000 | 0.028 | 0.061 | 0.111 | 0.279 | 0.017 | 0.004 | 0.056 | 0.111 | 0.145 |
| feature_002 | 0.252 | 1.000 | -0.329 | 0.255 | 0.132 | 0.057 | 0.017 | -0.035 | -0.048 | 0.002 | 0.051 | 0.067 | 0.120 | 0.143 | 0.026 | 0.005 | -0.052 | -0.042 | 0.176 |
| feature_003 | 0.251 | -0.329 | 1.000 | -0.436 | -0.121 | -0.128 | 0.022 | 0.028 | 0.044 | 0.012 | 0.011 | 0.056 | 0.125 | 0.203 | 0.022 | 0.088 | 0.118 | 0.351 | 0.098 |
| feature_004 | 0.185 | 0.255 | -0.436 | 1.000 | 0.130 | 0.111 | 0.013 | -0.038 | -0.036 | 0.004 | 0.040 | 0.072 | 0.173 | 0.096 | 0.041 | 0.099 | -0.116 | -0.331 | 0.251 |
| feature_005 | 0.085 | 0.132 | -0.121 | 0.130 | 1.000 | -0.009 | 0.030 | -0.090 | -0.064 | -0.021 | 0.060 | 0.094 | 0.074 | 0.096 | -0.008 | 0.033 | 0.009 | -0.043 | 0.143 |
| feature_006 | 0.130 | 0.057 | -0.128 | 0.111 | -0.009 | 1.000 | 0.018 | 0.007 | 0.192 | 0.018 | 0.009 | 0.034 | 0.064 | 0.694 | 0.020 | 0.106 | -0.062 | -0.040 | 0.158 |
| feature_007 | 0.019 | 0.017 | 0.022 | 0.013 | 0.030 | 0.018 | 1.000 | 0.238 | 0.170 | 0.000 | 0.000 | 0.042 | 0.011 | 0.025 | 0.022 | 0.004 | 0.000 | 0.008 | 0.042 |
| feature_008 | 0.037 | -0.035 | 0.028 | -0.038 | -0.090 | 0.007 | 0.238 | 1.000 | 0.610 | 0.698 | 0.025 | 0.061 | 0.041 | 0.069 | 0.010 | -0.003 | -0.004 | 0.007 | 0.117 |
| feature_009 | 0.058 | -0.048 | 0.044 | -0.036 | -0.064 | 0.192 | 0.170 | 0.610 | 1.000 | 0.604 | 0.026 | 0.041 | 0.033 | 0.269 | 0.012 | 0.040 | -0.008 | 0.022 | 0.116 |
| feature_010 | 0.000 | 0.002 | 0.012 | 0.004 | -0.021 | 0.018 | 0.000 | 0.698 | 0.604 | 1.000 | 0.000 | 0.004 | 0.007 | 0.007 | 0.031 | 0.010 | -0.015 | 0.006 | 0.010 |
| feature_011 | 0.028 | 0.051 | 0.011 | 0.040 | 0.060 | 0.009 | 0.000 | 0.025 | 0.026 | 0.000 | 1.000 | 0.037 | 0.037 | 0.005 | 0.009 | 0.025 | 0.021 | 0.022 | 0.079 |
| feature_012 | 0.061 | 0.067 | 0.056 | 0.072 | 0.094 | 0.034 | 0.042 | 0.061 | 0.041 | 0.004 | 0.037 | 1.000 | 0.056 | 0.064 | 0.011 | 0.000 | 0.014 | 0.022 | 0.140 |
| feature_013 | 0.111 | 0.120 | 0.125 | 0.173 | 0.074 | 0.064 | 0.011 | 0.041 | 0.033 | 0.007 | 0.037 | 0.056 | 1.000 | 0.053 | 0.024 | 0.007 | 0.056 | 0.069 | 0.210 |
| feature_014 | 0.279 | 0.143 | 0.203 | 0.096 | 0.096 | 0.694 | 0.025 | 0.069 | 0.269 | 0.007 | 0.005 | 0.064 | 0.053 | 1.000 | 0.058 | 0.033 | 0.033 | 0.150 | 0.195 |
| feature_015 | 0.017 | 0.026 | 0.022 | 0.041 | -0.008 | 0.020 | 0.022 | 0.010 | 0.012 | 0.031 | 0.009 | 0.011 | 0.024 | 0.058 | 1.000 | 0.224 | 0.032 | 0.049 | 0.133 |
| feature_016 | 0.004 | 0.005 | 0.088 | 0.099 | 0.033 | 0.106 | 0.004 | -0.003 | 0.040 | 0.010 | 0.025 | 0.000 | 0.007 | 0.033 | 0.224 | 1.000 | 0.044 | 0.097 | 0.026 |
| feature_017 | 0.056 | -0.052 | 0.118 | -0.116 | 0.009 | -0.062 | 0.000 | -0.004 | -0.008 | -0.015 | 0.021 | 0.014 | 0.056 | 0.033 | 0.032 | 0.044 | 1.000 | 0.073 | 0.078 |
| feature_018 | 0.111 | -0.042 | 0.351 | -0.331 | -0.043 | -0.040 | 0.008 | 0.007 | 0.022 | 0.006 | 0.022 | 0.022 | 0.069 | 0.150 | 0.049 | 0.097 | 0.073 | 1.000 | 0.131 |
| is_cheating | 0.145 | 0.176 | 0.098 | 0.251 | 0.143 | 0.158 | 0.042 | 0.117 | 0.116 | 0.010 | 0.079 | 0.140 | 0.210 | 0.195 | 0.133 | 0.026 | 0.078 | 0.131 | 1.000 |
Missing values
Sample
| user_hash | feature_001 | feature_002 | feature_003 | feature_004 | feature_005 | feature_006 | feature_007 | feature_008 | feature_009 | feature_010 | feature_011 | feature_012 | feature_013 | feature_014 | feature_015 | feature_016 | feature_017 | feature_018 | high_conf_clean | is_cheating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 34230 | d558f000c89bed2a | 1.0 | 2.0 | 7.0 | 7.0 | 1.0 | 5.0 | 1.0 | 1.0 | 1.0 | 21915.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.031157 | 1 | 10.933333 | 0.500000 | 1.0 | NaN |
| 140301 | 1e9668e66b1d6253 | 1.0 | 3.0 | 5.0 | 1.0 | 6.0 | 5.0 | 1.0 | 8.0 | 9.0 | 304.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.030023 | 0 | NaN | 0.450000 | 1.0 | NaN |
| 143406 | c0e9e1e943261b5d | 1.0 | 2.0 | 7.0 | 1.0 | 3.0 | 7.0 | 1.0 | 9.0 | 9.0 | 242.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.037963 | 0 | NaN | 0.500000 | 1.0 | NaN |
| 101176 | df3d93556a7b915d | 3.0 | 2.0 | 7.0 | 9.0 | 2.0 | 7.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 10.409340 | 2 | 16.841667 | 0.450000 | 1.0 | NaN |
| 244995 | 302265f9f7d11aa1 | 1.0 | 2.0 | 6.0 | 7.0 | 3.0 | 8.0 | 1.0 | 9.0 | 9.0 | 4.0 | 0.0 | 1.0 | 1.0 | 1.0 | 21.321968 | 15 | 12.822378 | 0.544583 | 1.0 | NaN |
| 45102 | 47afc1c972264712 | 1.0 | 2.0 | 8.0 | 1.0 | 2.0 | 0.0 | 1.0 | 9.0 | 0.0 | 1595.0 | 0.0 | 0.0 | 1.0 | 0.0 | 29.042234 | 1 | 17.283333 | 0.700000 | 1.0 | NaN |
| 250921 | 89fd8827b3ced4fd | 1.0 | 6.0 | 3.0 | 1.0 | 6.0 | 6.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.030451 | 4 | 19.050941 | 0.530000 | 1.0 | NaN |
| 204269 | 2ee0108fc5b82c63 | 1.0 | 2.0 | 9.0 | 1.0 | 7.0 | 1.0 | 1.0 | 1.0 | 0.0 | 323580.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.091759 | 1 | 10.600000 | 0.750000 | 1.0 | NaN |
| 269514 | ffdaf5eed5cd46c1 | 2.0 | 2.0 | 8.0 | 1.0 | 7.0 | 1.0 | 1.0 | 10.0 | 10.0 | 385.0 | 0.0 | 0.0 | 1.0 | 1.0 | 30.094421 | 2 | 22.891667 | 0.532500 | 1.0 | NaN |
| 120181 | 85e5a153f1eb0870 | 4.0 | 2.0 | 9.0 | 1.0 | 2.0 | 6.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 100.904769 | 4 | 20.310662 | 0.750000 | 1.0 | NaN |
| user_hash | feature_001 | feature_002 | feature_003 | feature_004 | feature_005 | feature_006 | feature_007 | feature_008 | feature_009 | feature_010 | feature_011 | feature_012 | feature_013 | feature_014 | feature_015 | feature_016 | feature_017 | feature_018 | high_conf_clean | is_cheating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 145125 | ab432b915912747c | 5.0 | 8.0 | 6.0 | 8.0 | NaN | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 23.803530 | 1 | 17.666667 | 0.475000 | NaN | 1.0 |
| 49676 | 64a53fa59e79a770 | NaN | NaN | NaN | NaN | 8.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN | 0.0 | 0.0 | 42.123009 | 0 | NaN | NaN | NaN | 1.0 |
| 29382 | bd4a4c22e3fb9530 | 5.0 | 7.0 | 4.0 | 1.0 | 7.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.030752 | 1 | 16.883333 | 0.250000 | NaN | 1.0 |
| 53368 | 2409a2de9b418e5e | 5.0 | 8.0 | 3.0 | 9.0 | 1.0 | 8.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 389.813090 | 0 | NaN | 0.100000 | NaN | 1.0 |
| 260703 | 515f3726bddee1ef | NaN | NaN | NaN | 3.0 | 3.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 101.239201 | 1 | 12.750000 | NaN | NaN | 1.0 |
| 228852 | 34438d53cdc38c81 | 5.0 | 8.0 | 4.0 | 9.0 | 5.0 | 0.0 | 1.0 | 1.0 | 0.0 | 1121.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.922396 | 0 | NaN | 0.444400 | NaN | 1.0 |
| 35011 | 14bd76ccf956ca41 | 5.0 | 8.0 | 7.0 | 10.0 | 3.0 | 7.0 | 1.0 | 5.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 89.923669 | 1 | 15.616667 | 0.340555 | NaN | 1.0 |
| 156692 | 9777d61d6267e106 | NaN | NaN | NaN | NaN | 3.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 97.321169 | 1 | 11.050000 | NaN | NaN | 1.0 |
| 119644 | 2eda92d27c730685 | 5.0 | 8.0 | 4.0 | 9.0 | 9.0 | 7.0 | 1.0 | 1.0 | 1.0 | 78.0 | 0.0 | 2.0 | 0.0 | 1.0 | 0.034965 | 2 | 3.683333 | 0.850000 | NaN | 1.0 |
| 189503 | f0028eb1ed639989 | 5.0 | 2.0 | 2.0 | 8.0 | 6.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 1.0 | 0.0 | 0.024931 | 1 | 20.950000 | 0.133300 | NaN | 1.0 |